A New Database of Digits Extracted from Coins with Hard-to-Segment Foreground for Optical Character Recognition Evaluation

نویسندگان

  • Xingyu Pan
  • Laure Tougne
چکیده

Since the release date struck on a coin is important information of its monetary type, recognition of extracted digits may assist in identification of monetary types. However, digit images extracted from coins are challenging for conventional optical character recognition methods because the foreground of such digits has very often the same color as their background. In addition, other noises, including the wear of coin metal, make it more difficult to obtain a correct segmentation of the character shape. To address those challenges, this article presents the CoinNUMS database for automatic digit recognition. The database CoinNUMS, containing 3,006 digit images, is divided into three subsets. The first subset CoinNUMS_geni consists of 606 digit images manually cropped from high-resolution photographs of well-conserved coins from GENI coin photographs; the second subset CoinNUMS_pcgs_a consists of 1,200 digit images automatically extracted from a subset of the USA_Grading numismatic database containing coins in different quality; the last subset CoinNUMS_pcgs_m consists of 1,200 digit images manually extracted from the same coin photographs as CoinNUMS_pcgs_a. In CoinNUMS_pcgs_a and CoinNUMS_pcgs_m, the digit images are extracted from the release date. In CoinNUMS_geni, the digit images can come from the cropped date, the face value, or any other legends containing digits in the coin. To show the difficulty of these databases, we have tested recognition algorithms of the literature. The database and the results of the tested algorithms will be freely available on a dedicated website.1

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Modfied Self-organizing Map Neural Network to Recognize Multi-font Printed Persian Numerals (RESEARCH NOTE)

This paper proposes a new method to distinguish the printed digits, regardless of font and size, using neural networks.Unlike our proposed method, existing neural network based techniques are only able to recognize the trained fonts. These methods need a large database containing digits in various fonts. New fonts are often introduced to the public, which may not be truly recognized by the Opti...

متن کامل

Zone Based Features for Handwritten and Printed Mixed Kannada Digits Recognition

In the field of Optical Character Recognition (OCR), zoning is used to extract topological information from patterns. In this paper we propose Zone based features for recognition of the mixer of Handwritten and Printed Kannada Digits. A digit image is divided into 64 zones and pixel density is computed for each zone. This procedure is sequentially repeated for entire zone. Finally 64 features a...

متن کامل

Image Based Recognition of Ancient Coins

Illegal trade and theft of coins appears to be a major part of the illegal antiques market. Image based recognition of coins could substantially contribute to fight against it. Central component in the permanent identification and traceability of coins is the underlying classification and identification technology. However, currently available algorithms focus basically on the recognition of mo...

متن کامل

A new fuzzy geometric representation for online isolated character recognition

This paper introduces a new fuzzy representation for isolated character description. This representation maps a character from its original sequence of 2D coordinates into a fuzzy vector space that can thereafter serve as input to any artificial neural network classifier. Recognition experiments on isolated digits extracted from the UNIPEN database are then conducted to evaluate the performance...

متن کامل

Recognition of Handwritten Digits Using Multilayer Perceptrons

Neural networks are often used for pattern recognition. They prove to be a popular choice for OCR (Optical Character Recognition) systems, especially when dealing with the recognition of printed text. In this paper, multilayer perceptrons are used for the recognition of handwritten digits. The accuracy achieved proves that this application is a working prototype that can be further extended int...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Front. ICT

دوره 2017  شماره 

صفحات  -

تاریخ انتشار 2017